z-squared: The Origin and Application of χ2
نویسنده
چکیده
A set of statistical tests termed contingency tests, of which χ is the most well-known example, are commonly employed in linguistics research. Contingency tests compare discrete distributions, that is, data divided into two or more alternative categories, such as alternative linguistic choices of a speaker or different experimental conditions. These tests are highly ubiquitous, and are part of every linguistics researcher’s arsenal. However the mathematical underpinnings of these tests are rarely discussed in the literature in an approachable way, with the result that many researchers may apply tests inappropriately, fail to see the possibility of testing particular questions, or draw unsound conclusions. Contingency tests are also closely related to the construction of confidence intervals, which are highly useful and revealing methods for plotting the certainty of experimental observations. This paper is organised in the following way. The foundations of the simplest type of χ test, the 2 × 1 goodness of fit test, are introduced and related to the z test for a single observed proportion p and the Wilson score confidence interval about p. We then show how the 2 × 2 test for independence (homogeneity) is derived from two observations p1 and p2 and explain when each test should be used. We also briefly introduce the Newcombe-Wilson test, which ideally should be used in preference to the χ test for observations drawn from two independent populations (such as two subcorpora). We then turn to tests for larger tables, generally termed “r × c” tests, which have multiple degrees of freedom and therefore may encompass multiple trends, and discuss strategies for their analysis. Finally, we turn briefly to the question of differentiating test results. We introduce the concept of effect size (also termed ‘measures of association’) and finally explain how we may perform statistical separability tests to distinguish between two sets of results.
منابع مشابه
Inequalities for the polar derivative of a polynomial with $S$-fold zeros at the origin
Let $p(z)$ be a polynomial of degree $n$ and for a complex number $alpha$, let $D_{alpha}p(z)=np(z)+(alpha-z)p'(z)$ denote the polar derivative of the polynomial p(z) with respect to $alpha$. Dewan et al proved that if $p(z)$ has all its zeros in $|z| leq k, (kleq 1),$ with $s$-fold zeros at the origin then for every $alphainmathbb{C}$ with $|alpha|geq k$, begin{align*} max_{|z|=...
متن کاملNonlinear Optical Properties of Rigid Polyurethane Foam/SiO2 Nanocomposite
Polyurethane closed cell (PUCC)/SiO2 nanocomposites have been prepared by using in situ polymerization approach. The third-order optical nonlinearities of PUCC/SiO2 nanocomposites, dissolved in DMF are characterized by Z-scan technique at the measurement wavelength of 532 nm. The nonlinear refractive (NLR) indices and nonlinear absorption (NLA) coefficients of samples were calculated from close...
متن کاملA matrix method for estimating linear regression coefficients based on fuzzy numbers
In this paper, a new method for estimating the linear regression coefficients approximation is presented based on Z-numbers. In this model, observations are real numbers, regression coefficients and dependent variables (y) have values for Z-numbers. To estimate the coefficients of this model, we first convert the linear regression model based on Z-numbers into two fuzzy linear regression mode...
متن کاملApplication of glauconite and fossil palynomorphs in reconstructing the Liassic paleogeography just before the opening of the Gulf of Mexico
Red beds, conglomerates and salt were considered azoic and problematic rocks, but Paleopalynology and Inorganic Geochemistry proved to be useful for placing them in time and space. In the early last century, in Mexican NE region, only three Mesozoic red bed units were differentiated, dated as Late Triassic to Late Jurassic. It was important tratigraphically to place them properly as they were c...
متن کاملPreparation and Application of Al3+ - Sensor Based On (2Z) — Methyl 2 — ((z) (p-tolylimino) -3-Ethyl —4-0xothiazolidin —5— Ylidene Acetate in PVC Matrix
Al3+-Potentiometric sensor, based on (2Z) -methyl 2- ((z) (p-tolylimino)-3-ethyl -4-oxothiazolidin -5- ylidene) Acetate (MTEOY) as a neutral ionophore, was successfully developed for the detectionof Al3+ in aqueous solutions. The electrode responds to Al3+ ion with a sensitivity of 19.8 ± 0.1 mV/decade over the range 1.0 x 10-8- 1.0 x 10-1 mol LT' and in a pH range of 3.0-9.0. The electrodeshow...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of Quantitative Linguistics
دوره 20 شماره
صفحات -
تاریخ انتشار 2013